Supporting Multilingual Information Retrieval in Web Applications: An English-Chinese Web Portal Experiment
نویسندگان
چکیده
Cross-language information retrieval (CLIR) and multilingual information retrieval (MLIR) techniques have been widely studied, but they are not often applied to and evaluated for Web applications. In this paper, we present our research in developing and evaluating a multilingual English-Chinese Web portal in the business domain. A dictionary-based approach has been adopted that combines phrasal translation, co-occurrence analysis, and preand post-translation query expansion. The approach was evaluated by domain experts and the results showed that co-occurrence-based phrasal translation achieved a 74.6% improvement in precision when compared with simple word-by-word translation.
منابع مشابه
Supporting Multilingual Internet Searching and Browsing
The amount of non-English information has proliferated rapidly in recent years. The broad diversity of the multilingual content presents a substantial research challenge in the field of knowledge discovery and information retrieval. Therefore there is an increased interest in the development of multilingual systems to support information sharing across languages. The goal of this dissertation i...
متن کاملIntegrating Query Translation and Document Translation in a Cross-language Information Retrieval System
Due to the explosive growth of the WWW, very large multilingual textual resources have motivated the researches in Cross-Language Information Retrieval and online Web Machine Translation. In this paper, the integration of language translation and text processing system is proposed to build a multilingual information system. A distributed English-Chinese system on WWW is introduced to illustrate...
متن کاملCMedPort: An integrated approach to facilitating Chinese medical information seeking
As the number of non-English resources available on the Web is increasing rapidly, developing information retrieval techniques for non-English languages is becoming an urgent and challenging issue. In this research to facilitate information seeking in a multilingual world, we focused on discovering how search-engine techniques developed for English could be generalized for use with other langua...
متن کاملMultilingual Information Retrieval in World Wide Web
The article addresses: (1). The design of an information retrieval (IR), as the Multilingual Information Retrieval Tool Hierarchy (MIRTH), which with virtual corpora on the World Wide Web, also known as Web or WWW. It is motivated by the desire to create a search engine to retrieve information by accessing a virtual. (2). The implementation of a general model of multilingual retrieval for the W...
متن کاملA Multilingual Information Retrieval Tool Hierarchy for a WWW "Virtual Corpus"
The article addresses: 1. the design of an information retrieval (IR) toolkit, named as the Multilingual Information Retrieval Tool Hierarchy (MIRTH) search engine, which works with virtual corpora on the World Wide Web, also known as the Web or WWW for short. It is motivated by the desire to create a multilingual search engine to retrieve information by accessing a virtual corpus; 2. the imple...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003